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GRAPHIC USER INTERFACE FOR icons for each function, to allow distinguishability. As used 

DATABASE SYSTEM in these systems, the graphical programming features arc not 

generally intended to make relatively simple searches more 

The present application is a Continuation of U.S. patent efficient, but rather to allow a programmer to formulate 

application Ser. No. OS/772,650 filed Dec. 23, 1996, the 5 complex reports at a relatively high level, 

entirety of which is expressly incorporated herein by refer- The use of search engines to search databases is known, 

ence U.S. Pat. No. 5,966,126. So is the ability to use Boolean logical connectors to conduct 

prpr n or thf iMVFNJTrnv a search - Thc use °f a set of results of a Boolean search lo 

inc. nNvtfNnuiN performanewsearch,bymodincationofaBooleanoperator 

The present invention relates to the field of graphic user 10 or tfa e addition of new Boolean constraints, is known, 

interfaces for computer systems, and more particularly to The use of Venn diagrams is known in mathematical set 

graphic user interface systems having graphic objects rep- theory. Other graphical representations include histograms, 

resenting data sets,, in which a manipulation of the graphic line graphs, circle graphs, star graphs, matrices, and scatter 

object instructs the system to perform logical or set theory plot diagrams. In general, the applicability of these different 

operations on the data represented by the graphic object. 15 types of graphical representations relies on the ability of a 

p a rvr 15 OTtKrn nc tuc rvn/Evrrmxr user 10 define desired subsets, e.g., using crisp logic, "fuzzv 

BACKGROUND OF THE INVENTION IogiC; graphic regions of the set spacc> or othcr 

Traditionally, graphic user interfaces (hereinafter "GUIs") sucn as a ranking according to a criteria, to define the desired 

have been employed to facilitate interaction of a user with subset. 

the file and execution system of a computer system, to 20 However, it is not well known in the art of front ends for 

present information graphically and/or in a manner approxi- search engines or browsers to use we 11- articulated graphical 

mating a printed page ("what you see is what you get" or representations of set theory, including graphical represen- 

"WYSIWYG''), or to present a desktop metaphor. Of course, tations of the logical operations "AND", "OR" and "NOT," 

there are other aspects of modern GUIs not encompassed by „ and any symbols that are used tend to be static and non- 

this generalization, but which are known to those skilled in " 5 manipulable. 

me art - U.S. Fat. No. 5,551,488, expressly incorporated herein by 

Typical graphic objects in a GUI are icons presented on a reference in the entirety, provides a system in which a 

background or '"desktop", with icons each representing a set graphical representation of a narrowing query set operation, 

of files, application programs, system resources, or the like. , 0 e -g-. a Venn diagram representing two sets within a data 

These icons may be relocated or selected, which instructs the ~ space, is provided within a window. Each window represents 

underlying operating system to relocate the referenced file in 1 data space, and the window includes but a single repre- 

the directory structure, execute or open the file, depending sentation. The search may be modified by selecting areas of 

on its type, or to operate on one referenced file by another the graphic object representation within the window. There 

referenced application. A known paradigm is "drag and 35 is no provision for employing the graphical representation as 

drop", which allows the one or more objects, represented by a» icon. Thus, this system allows the user to visualize the 

one or more icons, to be operated on by another object, also element of the search in a graphical representation, but does 

represented by a graphic object or icon. Such functions not graphically represent the chaining of search criteria, 

include, but are uot limited to, move, copy, execute, filter, U.S. Pat. No. 5,544,267, expressly incorporated herein by 

open, print, and delete. In general, however, the user does ^ reference in the entirety, provides a system in which an 

not operate to modify the icon itself with a corresponding imprecise sketch of a graphical set representation is inputted 

function executed by the operating system or application. and interpreted by a computer system. This system does not 

In addition, within a group of objects, the objects may be provide for interactive modification of the search based on 

ranked, e.g., by name, type, size, date, location, relevance to ^ c graphic representation. 

search criterion, or other attribute. Thus, by defining a 45 U.S. Pat. No. 5,418,950, expressly incorporated herein by 

graphic object representing a set of files, hinctions may be reference in the entirety, relates to a graphic structured query 

performed on the set of files. These known GUIs operate at language ("SQL") query generator system. This system 

a computer or disk operating system level, and thus typically employs graphic models of SQL language commands to 

operate on data sets which represent discrete files. Known generate a report. There is no provision for manipulation of 

GUIs include Apple Macintosh Operating System 7 and 50 configuration or selection of subsidiary objects within 

Microsoft Windows 95 and Windows NT. graphic icons to represent set inclusion criteria. 

Another paradigm for dealing with data in more typical U.S. Pat. No. 5,442,738, expressly incorporated herein by 
databases includes GUIs for database system formulation. In reference in the entirety, provides a system in which a 
these systems, data sets include a plurality of addressable logical and arithmetic relation of objects may be defined and 
records. The database operations are represented graphically 55 manipulated using a GUf. The graphic objects themselves, 
and may be manipulated by the user to formulate a search or however, are not modified in configuration nor are subsid- 
report. In this case, functions may be chained, so that i arv objects within graphic representations selected to rep- 
complex database searches may be formulated by graphic resent a selection of set inclusion criteria, 
representations of elemental operations. These systems do None of the above-cited references formalizes a system of 
not. however, provide a generic icon which is modified lo <so notation, bounded and conventionally stylized, whether for 
reflect a data subset employed. In general, these known display on a computer or otherwise, for description of 
systems are employed in the design of a database system or traditional logical relationships of conventional logic 
customized reports, rather than by a user to formulate a diagrams, or for translation to and from conventional logic 
search during normal use of the system. This is because diagrams. 

these graphic operations represent program steps or objects, 65 Traditional logical symbols are arcane, and do not inm- 

and therefore the resulting complex program must still be itively represent the set'inciusion criteria. Thus, while AND, 

verified and debugged. These systems also present different OR and NOT are readily understood, a complex expression 
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employing such words may become incomprehensible. Con- regions representing a Boolean relationship of two or more 

ventiouai symbols, in which AND is represented by a half data sets. For simplicity, the majority of the present descrip- 

circle, OR by a heavy vertical line. NOT by a circle, tion will focus on the relationship of two sets, with the 

Exclusive OR by a sort of shield, are not imuittve. Likewise, extensions to the representation of three or more sets as 

the use of fl. U and / to represent AND, OR and NOT in set 5 known in the art. 

theory, and a, v, t i a proposition! logic, are also not In a preferred embodiment, a generic icon is provided as 

intuitive. a Venn diagram icon, divided into four set spaces, with two 

intersecting circles in a rectangular box. The space within 

SUMMARY OF THE INVENTION each circle and not intersecting defining two set spaces, the 

tu» , ■ _ ■ * * , ■ m intersection defining a third set space, and the external area 

,n Z T h°H P """f ™ulaung 10 defining a fourth it space. The* set spaces are indepen- 

aua renning a searcn, oy employing a "live graphic icon dently selectable, defining the various binarv operators of 

representing set operators that may be modified and/or the elements of sets A and B, in groups of up to four spaces, 

linked. This representation is preferably bounded and con- as shown in FIG. 6: . 
ventionally stylized, providing a more intuitive language for 

the presentation and use of logical relationships between 15 
elements. The icons are also preferably selected from a small 
group of generic icons, which axe visually modified to 
indicate a set function. 

Aspects of the present invention address the following 
problems of existing front ends for browsers or search 
engines: 

(1) proliferation of results, which are extremely time 
consuming or impossible to consider in their totality; 
and alternatively a paucity of results; 

(2) aggregation of data results, in many cases, in one 
undifferentiated pile; 

(3) loss of the logical history of a search, and the 
corresponding hot links, requiring reentering and 
researching of requests in some cases; 

(4) lack of an intuitively appealing graphic representation 
either of a projected search or the results thereof, 

(5) lack of an intuitively appealing graphic interface for * n rms system, a generic graphic icon is defined having a 
interaction with an "intelligent agent"; boundary and two intersecting areas in the boundary, each 

(6) lack of a suitable graphic interface for manipulation of 35 area repeating a data set, an intersecting portion of the 
index and table of contents data; arcas representing a conjunction of the data sets. The generic 

(7) lack of a suitable graphic interface to handle a search ic ° n thu / f ias four discrete rc S ions ' the P orlion outside lue 
of two or more databases simultaneously; uni0Q of the arcas ' an mtcrscclin S portion of the areas, and 

(8) tack of a suitable graphic interface for handling m EES!! ^ ™ " d ^ ^ T*' A 

complex data searches involving fuzzv logic; and * 40 f^" 0 " f ° f ™ w m ° K t J™ Wkin lhC » 

/nN . , c . . . , ,. 5 * h 1 received trom the user to denne an oumut data set and 

(9) lack of an intuitively appealing interface to represent presenting the generic graphic icon on the GUI as a first 
the results ot a database search. modified graphic icon, having visual indication of the 

(10) Lack of a graphic interface for search processes that selected regions corresponding to the denned output set. In 
is universally understandable, regardless of one s nam- 45 addition, a set inclusion property is defined to correspond to 
ral language. cllc h data set Lor the first modified graphic icon. These 

The present invention therefore addresses these problems, graphic icons may be selected from a pick list or tool bar or 

and provides other advantages, through a new type of system generated from a'template geaeric icon. For example, com- 

for dealing with data sets graphically at a user, rather than monly used relationships may be provided on a tool bar, with 

programmer, level. As envisioned by the present invention, 50 all possibilities available by manipulation of a generic icon 

sets of data within a data space are represented graphically. template, in which selected regions are altered, e.g., change 

In general, the data set represented by the graphic represen- color. The icon may then be used in a larger graphic 

lation are defined by a rule or set of rules. However, a representation of a complex logical definition, by chaining 

particular advantage of the present invention is that a or linking a plurality of icons together. This chaining or 

plurality of these graphic representations, each representing 55 linking process is generally by graphic connector .lines 

a different data set defined by a different rule or set of rules, which connect the inputs and outputs of the iconic repre- 

may be defined and depicted on "the interface, either sen la tion. These connector lines may be smart, i.e., intelli- 

discretely, sequentially or simultaneously. Typically, a user gently routed, spaced and sized in accordance with the 

will seek to use the database system to help define an overall search strategy, graphic presentation and page or 

optimal resulting data subset, wherein the rule or rules 60 presentation layout. 

defining that data set are unknown at the outset or are A preferred Venn diagram icon according to the present 

inefficiently defined as a single consolidated set of rules. invention includes a rectangular box with two overlapping 

Thus, by defining a group of less complex rule sets, and then circles staggered symmetrically along the elongated axis, 

consolidating the rule sets, the desired information may be Each circle is connected to a line intersecting a side of the 

analyzed and extracted. 6 5 box, either spaced through an elongated side or through 

One aspect of the present system therefore presents a opposite short sides, from the outside. A third line emanates 

basic or generic icon, defining a plurality of logical. data from an elongated side of the box, opposite the inputs if 
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these pass through an elongated side. Each Hoe may be within a Venn diagram. The relative size of a circle repre- 

provided with an arrowhead to indicate the nature of the line sents the operator ~for a parameter determining a single set 

as generally being considered an input or an output, with an inclusion rule, and a relative position of the circle represents 

arrowhead pointing inward for an input and outward for an the operator for a parameter deterrainins a multiple set 

output. Hie circle closest to each line corresponds to that 5 inclusion rule. These parameters are generally continuous or 

line. The standard Venn diagram notation is employed, e.g.. stepwise continuous variables, and thus a resizing or lug on 

a change in color or density of a circle to represent selection a circle instructs the system to change a scope of an input or 

of a particular set, with each graphic region representing a output parameter, respectively. In this case, after 

different subset of the data space. manipulation, the icon returns to its original configuration, 

The standard Boolean set operators fail to provide certain 10 so that further relative changes may be applied? During 

useful functions, such as ranges, chronology, ranking, prox- manipulation, a different graphic image may be presented to 

imity and semantic features. These operations are often the user, visually indicative of the change "in progress, 

useful or critical m formulating a desired search. Therefore, For example, a truncation, thesaurus, or relevance rule 

the present invention extends the standard Boolean logic may be applied based on a parametric operator. The set 

with these known search operators to provide potentially full 15 inclusion operator need not be unidimensional, and in the 

functionality within the iconic representation. case of dimensions of the set inclusion or ranking rule, not 

Often, these complex features may be formulated in readily accessible through the icon interface, the user will, 
Boolean terms. For example, a semantic or linguistic analy- for example, double click on the object or portion of the 
sis of a term may provide a number of synonyms and object to display a multidimensional graph, such as a circle 
antonyms, foreign language equivalents, as well as words 20 having a plurality of spaced radially oriented axes. In manv 
related by root. Therefore, a search for a word as a concept instances, a set' of slide controls, each labeled with an 
may be broadened by providing a search based on the associated characteristic for variation, is preferred, and may 
meaning, as expressed through the linguistic or semantic be easily provided using standard GUI software developer's 
analysis. This meaning may be formulated as a Boolean kits. The slide controls may represent parametric or non- 
expression, albeit complex. According to the present 25 parametric variables. 

invention, this complex expression may be represented by a In general, in each case, one or possibly two designated 

single icon, for example a Venn diagram, or chain of default parametric operators will be represented by the 

diagrams, having, for example, an "L ,? next to it, an icon of alteration of the iconic representation, while some or all of 

a person speaking, an icon of a mouth, a Mag indicating the the set inclusion properties may be accessed bv way of the 

country of the language spoken, or a combination thereof, 30 secondary operation. 

indicating extended linguistic analysis. The linguistic analy- In contrast to many types of traditional icons, which are 
sis might also display a representation of parallel linguistic purely graphic elements which signify an underlying pro- 
analysis in different languages, for example, as a set of gram object, the icons of the present invention are active, 
parallel icons, allowing the user to select one or more of the and manipulate to signal to the computer a user input. In the 
outputs. 35 Venn diagram example provided above, the icons arc sen- 
In other cases, traditional Boolean logical expressions sitive to gestures of position and size of internal objects. In 
cannot readily represent the search concept, such as a other instances, the system may be sensitive to other types 
proximity of two terms. Tn this case, a number of alternatives of gestures and employ different graphical representations of 
are present. First, a separate icon may be used which sets. 

represents a distance between two search terms, which may aq A preferred system according to the present invention 

be presented as, for example, a rectangular icon having a provides icons having generally one or two inputs and oue 

number or value in the middle representing the distance, and output, with more complex constructs available by 

an upper and lower arrow representing a desired change in combining, chaining or linking the available icons. Of 

the value. Alternately, an icon representing a set inclusion course, efficiency may be gained in . certain instances by 

property having a parametric modifier may be identified by 45 providing functions with a greater number of inputs and/or 

a "flag" adjacent to the icon being operated on, for example outputs, and where these are commonly used, they may be 

located on top of or pierced by an input or output arrow. This provided as accessible icons. For example, a multiple input 

operator may thus replace the Venn diagram operator, as a OR or AND may be provided. 

separate icon, where the Boolean operation AND is implied, In general, the use of multidimensional icons is preferably 

or act as a modifier for an input or output of a Venn diagram 50 avoided, at least as a default, in order to simplify the 

operator, where full flexibility of Boolean functions are interface and improve the ergonomics of use. Therefore, a 

available. ^ default operation is defined, which will generally be a 

This parametric search modifier may be applied in mul- predetermined variable based on the data types and possible 

liple dimensions, as well. For example, a numeric mean and context sensitivity and artificial intelligence analysis of the 

range may be applied to a search by providing four controls 55 problem. In a system with a local or easily accessible 

over a parameter. This, however, leads to relatively large database, for example, the interface may conduct a sensi- 

icons which appear more like windows, and indeed are tivity analysis to determine the selective. power of various 

generally provided at the program level as windows, without criteria, and present as the default criterion that criterion 

title bars and the like. In this same manner, icons may be with a desired selectivity pattern. Where the database is 

constructed for any particular implication. The goal, 60 remote, difficult or expensive to access, e.g., where it is 

however, is to minimize the number of icon types and desired that access to the database be efficiently used, the 

provide generic icons which may represent a number of selective criterion may be defined by the user or suggested 

different search elements, providing an intuitive visual indi- to the user by the interface. 

cation of the function of the icon. To generate a typical query, producing a resultant data set. 

It may also be desirable to include the parametric func- 65 the user defines a plurality of input sets, and then establishes 

tionality within the Venn diagram operator. In this case, an a logical relationship between the inputs. The input set 

operator is provided with two intersecting areas, e.g., circles definitions may be produced in known manner, such as bv 
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explicit definition, or through use of assistive technologies, performed on the data set. Each set representation includes 
such as natural language translators. The formulated search graphic objects which are ^selectable" and modifiable, 
may then be presented to the user for tuning, fine control or meaning the graphic object mav be operated on separately 
modification. The input criteria may thus be schematic or from other graphic objects, but'more importantly, anv new 
imprecise, and indeed an advantage of the present invention 5 data sets resulting from the set operation are represented by- 
is its ability to graphically assist the user in refining the a secondary (or Tertiary etc.) graphic object which is also 
search - "selectable". Further, these primary and higher order sets 
In a further embodiment according to the iuvention, one can also be graphically merged or re -merged by use of the 
or more set inclusion criteria are associated with a graphic same graphic operators. 

indication of the number, ratio, amount or relationship of set 10 The interface system may, as a default, present ranked set 
members included and/or excluded by the criteria, and the members according to the inclusion criteria, with a weight- 
relationship of one set criterion with another criterion. For ing test where multivariable inclusion criteria are employed, 
example, two criteria may be redundant, e.g.. highly corre- The user then may modify !his ranking by, for example, 
lated. Therefore, one of the criteria may be eliminated, opening a text dialog box or a graphic window. In certain 
simplifying the search. Further, by analyzing the search 15 instances, the ranking criterion may be defined by graphi- 
parameters, the system may be able to assist the user in cally manipulating an icon or graphic image in correspond- 
formulating other searches which meet the user's criteria, for ing mariner to the set inclusion criterion, while in other 
example changing the scope of the search or applying a instances it is preferably defined explicitly bv the user. Thus, 
corresponding search to another database. with multivariate set inclusion criteria* the user may be 

As staled above, it is preferred that complex searches be 20 presented with a star graph icon for defining the relative 

formulated generally employing simple binary operations. weighting of each of the multiple variables in the rankings. 

The present invention therefore allows chaining of the When ranking data, a multiple key ranking system maybe 

biliary representations, in a tree format, to achieve these implemented with weightings applied to the different factors 

complex results or transfer functions. This allows the user to or combinations of factors. Thus, when booking airline 

view the formulation of the search and to modify any 25 tickets, the various criteria mav be ranked, and an" optimal 

element within the formulation, which may immediately available flight booked. The" criteria are, for example, 

update the entire search structure or selectively update based availability, price, airport, schedule, carrier, availability of a 

on relevance or another criterion. block of seats, as necessary, mode of transportation, 

In providing feedback or output to the user, a number of amenities, frequent flier perks,' and seat position. The corn- 
strategies may be used, or hybrids of the strategies, as 30 puter system retrieves information from one or more carrier 
appropriate. For example, in an on-line database, there may databases, such as AA Sabre, and then conducts a negotia- 
be charges, access fees or latencies for each search. Hon based on a strategy defined by the user. This negotiation 
Therefore, in this case, it is desired to formulate a search strategy may be adaptive, explicitly defined by the user, or 
entirely before executing; on the other hand, the results of a based on an artificial intelligence system embedded iu the 
search or a portion of the search may be readily available, 35 interface guided by the user. The various criteria may, for 
and such may be updated immediately. In some instances, a example, be presented to the user as a group of graphic 
portion of jhe database may be readily accessible, and a boxes which the user stacks in order to define a relative 
portion difficult to access, and these search requests may be importance of the various criteria. Alternately, a set of slider 
scheduled and/or dispatched accordingly. Further, results controls in a dialog box or window may be presented to 
may be returned in various sequences, and these may be 40 allow the user to weight each criterion independently, 
presented to the user in a non-confusing manner. In some In some instances, portions of the data may be hierarchi- 
instances, where database access is low cost, but latencies cally ranked, such as patent classifications, prerequisite trees 
are high, such as with a number of Internet based resources, for some college level classes, and professional and manu- 
it may be advantageous for the interface system to dispatch facturing directories. Therefore, this hierarchy may be used 
searches in advance of a final determination, which are 45 to define the desired set, rank the resulting members of the 
predicted to be of at least the desired scope. These may then set and derive cutoffs. 

be narrowed or sorted with less penalty, and with a shorter Where data is hierarchical, it is possible that one defined 
latency from the finalization of the search criteria. set is a subset of another defined set. Therefore, the inter- 
ior example, in a text proximity search, an initial search section of these sets is the subset. When this happens, certain 
with an AND operator may be initiated, with the resulting 50 of the possible functions become trivial. Therefore, where 
"hits" subjected to a proximity filter. the interface system is able to detect this condition, or other 
In using the system, the user will normally interact with conditions which make some of the interface functions 
the interface based on two possible scenarios; first, that a trivial or inoperative, such as an eraptv defined set or 
finite "correct" set of results exists, and the GUI database intersection set, a set definition which encompasses the 
system is used to assist in obtaining the correct result, and 55 entire data space, redundant set definitions, etc., the interface 
second that no single "correct" set of results exists, and the may warn the user to avoid inefficiency. Of course, there are 
GUI database system is used to define thresholds, ranges, instances where this information will "be unavailable to the 
boundaries, grey zones, rankings and so forth, so as to map interface prior to execution of a search, in which case the 
a search strategy to select an acceptable result or range of interface will perform its normal functions, even with these 
results. In the former case, the user may be able to determine 60 trivia lizations. 

the "correct" result when it is presented, while in the latter There are many different ways to graphically represent 

case, the user relies on the search strategy to present alt data data and sets of data, and the present invention therefore 

which meet the search criteria. preferably allows use of various optimized graphic repre- 

In either case, the present invention provides a graphic sentations of the underlying data sets and set operations, 

object which is manipulated, for example, portions selected, 65 depending on the preferences of the user and the context, 

repositioned or resized on the display with respect to the These graphic representations may be applied both for 

other graphic objects, to represent a set operation to be inputting and outputling data, and indeed the input and 
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output representations need not be the same. Just as in indication of the user. Thus, in a full text proximitv search- 
descriptive statistics, form follows function. For example, a able system, search criteria A AND B mi^ht, tor example, be 
simple two dimensional line graph may be adequate to narrowed in the order of A (same section) B, A (same 
describe two variables but not, for example, five. One paragraph) B, A (same sentence) B, A (adjacent) B. 
embodiment of the present invention will advise the user, or 5 Alternately, the word spacing between A AND B may be 
select by default for the user, optimal graphical representa- narrowed bv successively reducing the operator, A(wiihin n 
tions of a given problem. words of) B. Using artificial intelligence, these various 
A common method of graphically representing set schemes may be intermixed, and indeed, the search strategy 
operations, and a preferred representation according to the maybe tuned based on a number of reported hits; if too low, 
present invention, is the Venn diagram, wherein the data 10 the strategy is made less restrictive, if too high, it is made 
space is represented by a plane, and sets in the data space more restrictive, according to the hierarchy of the search, 
represented by contiguous bounded areas on the plane. Set As stated above, the present invention also allows search 
operations are represented by the intersection of lines and restrictions based on secondary criteria, such as rankings, 
the overlap of spaces denned by the contiguous bounded These rankings may be included with the proximitv opera- 
areas on the plane. In this case, an overlap of two areas 15 tors. Thus, it can be seen that the present invention \nav 
represents an AND, while the combined area bounded by include an intelligent interface which selects various avail'- 
each "disjoint" (separate) area represents an OR operation. able searching tools in order to refine a search. If these 
Using unitary Venn diagrams, more complex logic is less refinements are inappropriate, the intelligent interface mav 
easily represented, i.e., while such representations arc be adaptive to the user, and learn a desired search strategy, 
possible, ihey may require display systems with greater than :u which may be applied globally, for a particular user, "or 
two dimensions for greater than three sets. However, by during a particular search session. 

chaining of binary operations, more complex processes can In another example, a tvpical full text search using 

be readily presented on a single 2-D screen or page; indeed, relevance based search technology will result in a ranked list 

such chaining is fundamental to the present invention. of "hits", with presumably most relevant information at the 

The present system is not limited to searching data sets 25 top of the list and less relevant information at the bottom of 
using crisp logic, and may therefore include fuzzy logic the list. This ranking methodology is known in the art. When 
searches having corresponding set inclusion properties and a search is modified, essentially a new search must be 
set member rankings. In this case, the manipulation of conducted with a new relevance ranked list generated. If this 
graphic objects need not be limited to binary or crisp logic ranked list is represented graphically, a different set of 
relationships, and therefore an amount or proportionality of 30 inclusion rules may be defined, resulting in a different data 
movement may be used to indicate a set inclusion property set, represented byanother graphic object. Since each set or 
or set function. Thus, the Venn diagram operator may be criterion may be ranked, it" might be concluded that over- 
used to represent fuzzy logic set inclusion properties and lapping items higher on the list have greater significance 
gestures used to alter the boundaries of a fuzzy set or logical than overlapping items lower in the list. Alternately, the 
operations between fuzzy sets. 35 ranking of the overlapping data may be redetermined based 

Thus, typically, a search of a full text database or other on a composite ranking algorithm, such as a weighted 

types of complex data, including images, audio information, average paradigm. In either case, this ranking mav be 

scientific data, industrial data, financial data, etc., does not graphically represented, and the user may apply a threshold 

contemplate a single, finite, and final response; rather a function to the graphically presented data to limit the data 

broad search is conducted from which the resulting data set 40 set. 

is further optimized, based on the application of further rules A ranking of a resulting set from a search mav advanta- 
or manual review of the records. Therefore, a search strategy geously be graphically represented, in order to allow tuuiu? 
for these types of situations often proceeds with an expec- of the underlying search strategy or the ranking algorithm! 
tation of a number or clustering of relevant data records to A trained user will, for example, view a sample of. the 
be extracted from the data space, with the search strategy 45 resulting data sets to determine whether the search strategy 
refined until the expectation is met or the expectation returns the desired information, and if so, with what degree 
altered. In this case, by providing a graphical representation of quality. On the other hand, a user mav have no or little 
of the data sets in an optimized format, the set inclusion basis for making such a determination." In these various 
characteristics may be easily modified, or the modifications instances, the searching system may apply a range of rules, 
reversed or respecified, using the GUI, for example by using 50 from those narrowly defined by the user* 10 broad generic 
a graphical pointing device to shrink or expand sets or rules devised to adequately encompass many different 
intersections of sets. In other words, according to various searching scenarios. A user may review a "list of set 
embodiments of the invention, each portion or region of the membership, and manually include, exclude or rank mem- 
graphical representation is potentially the subject of selec- bers. The database system then formulates an "intelligent 
lion or configurational modification, representing a desired 55 agent" based on the actions of the user, which extracts a 
alteration in the corresponding set inclusion criteria. ranking or inclusion rule to complete the task or to search 
It is noted that, for a given set, the graphical representa- other data spaces for relevant data or continues the task lor 
tion of refining set inclusion criteria need not be a multiset the remainder of the data. 

representation and therefore may be a separate representa- In order to rank set members by multiple criteria, a 

tion from the Venn diagram icon. Where binary set opera- GO weighting function mav be appUed to the different criteria, 

lions are chained, and each input to a successive analysis is According to the present invention, this weighting function 

considered a set, an attempt to modify a composite set must may be indicated graphically. It is noted that the ranking or 

be analyzed to determine the significance of a change to the weighting functions are not constrained to have any rela- 

set of inclusion criteria for the composite set. Artificial tionship with the set inclusion criteria, although in" many 

intelligence techniques may be applied to analyze the com- 65 instances this will be the case. 

posite set inclusion criteria, and to propose an analogous When ranking a set member, a primary set inclusion 

change to the search which corresponds to the gesture or criterion is fulfilled, and generally only then is a secondary 
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criterion applied to order the set members. In this case, the a circle with a radius may be graphically presented to the 

ranking is generally applied where the next stage of the user. The user manipulates the set inclusion criteria by 

process truncates the first set based on this ranking, for relatively varying the radius; an increase in radius broaden- 

example taking only a certain number of sei members or ing the search and a narrowing of the radius closing the 

those having a ranking score above a certain level. 5 search. Of course, a line, polygon, or other graphic indicia 

It is important to understand that, in cases where the may be used instead of a circle. In fact, where multiple 

search is simple and the user has a clear expectation of the criteria are employed, multiple axes mav be presented, such 

desired results, the advantages of the present invention are as in a star graph. Alternately, a set of visual slide controls, 

primarily provided by the efficiencies gained by employing analogous to the board of a mixing preamplifier, may be 

a visually intuitive set theory paradigm interface, and the 10 used to control multi-valued variables, while on-off tougles 

searching technique itself will likely parallel a typical prior may be used to control binary variables, 

an search i.e., the user will apply specific and concrete In general, it is useful to provide default operators which 

search criteria, explicitly tailoring the search to the desired represent a most likely or best choice for a user. Thus, for 

scope, rather than relying on graphic tools to assist in each user action, the "system preferablv provides a default 

defining the search scope. On the other hand, with complex 15 which may be predetermined or adapt'ive, and which may 

search strategies or where an accurate expectation of a differ for each user action. Typicallv, the default operations 

"correct" set of search results is unavailable, the system and may be accepted by the user with a minimum of action, 

method according to the present invention allows the user to while an override is available which mav require further 

tune the search in a new manner, based on the set theory keystrokes or pointer activity. 

paradigm. In some instances, the result itself may differ fiom 20 In order to gesture to the interface an intent to tune a set 

the results available through known searching strategies inclusion property, the particular set is selected, and then the 

from existing systems. 'Hie present invention provides a set gesture made. Thus, since each input set is signified by a 

of graphic tools and raanipulable icons to provide a graphic bounded region in a Venn diagram, the border of the 

correspondence between the manipulation of the graphic bounded region may be enlarged or contracted to alter the set 

elements of the user interface by the user and the change in 25 inclusion property.' Further, anv region or regions may be 

search scope and/or results presentation strategy defined by selected for attempted manipulation, including the region 

the interface. outside the defined set spaces, the intersection and non- 

Often, a broad search seeks set members which corre- overlapping portion of each set. Likewise, the ranking of the 

spond to a concept, the search strategy being used to define each region in the Venn diagram, or composite regions, may 

the concept. These concepts may be unitary and indivisible, 30 be ranked. The gesture mav also pertain to ~a relation 

or complex and multifactorial. Therefore, real world search- between two sets, such as a Boolean relationship or a change 

ing often requires surrogate criteria, estimations, cutoffs and in a parameter. Therefore, not only mav the size of a region 

simplifying assumptions. Where a concept is clearly defin- he manipulated, but also the relative sparine and arranqe- 

able and the data readily analyzed to determine a match to mcnt of the spaces. 

the concept, the definition of this set is clear, and no 35 Imp licit in certain of the modifications discussed herein is 

manipulation is necessary. On the other hand, the surrogate that the user is provided with feedback as to the effect or 

criteria, estimations, cutoffs and simplifying assumptions expected effect of a particular modification. In this case, anv 

often lead to errors, i.e., loss of significant set members or of the known metric indicators may be employed to indicate 

inclusion or erroneous set members, or both. In a typical to the user the number of elements in the selected set. For 

search strategy, it is often desired that the initial search 40 example, a digital indicator, linear gauge, size of a bounded 

criteria be overinclusive, so that later the search may be region, color, color density, or other known indicator scheme 

narrowed in specific ways. Thus, the surrogate criteria, may be used. Thus, the user graphically manipulates the icon 

estimations, cutoffs and simplifying assumptions should until a desired metric is achieved or is expected to be 

generally be defined so as to over-encompass the desired achieved. While a user might also directiv indicate the 

data. Sometimes, multiple underinciusive set definitions are 45 desired metric, often, the data does not have 'a linear distri- 

uscd to define the concept, either because the search tools do bution or other simple distribution. In this case, the user can 

not allow a single criterion of sufficient breadth, or because "tweak" the set inclusion property until a desired portion of 

such a single criterion would result in the inclusion of a high the set is encompassed. Further, 'the svstem, in additiou to 

proportion of undesired set elements without a correspond- presenting a membership metric of a desired set inclusivitv, 

ing increase in desired set elements. 50 may also present a membership metric of an undesired set 

For each set definition which employs surrogate criteria, inclusivitv, allowing the user to define a desired "signal to 

estimations, cutoffs and simplifying assumptions, it there- noise ratio". In these cases, merely entering a desired size of 

fore may be desired to be able to tune the set inclusion a set has the same narrowing effect as defining a ranking 

properties and/or rank the set membership. One way to do criterion and setting a threshold cutoff of the ranked set; the 

this, for example with full text string searching, is with 55 rules for such a narrowing may be explicit or implicit'and 

proximity operators. For numeric data, intervals, ranges and predetermined or learned. 

statistical functions may be employed. For other types of The user may modify the graphical representation by 

data, other criteria may be employed, as appropriate for that changing a size, shape, 'position, color, texture, sound or 

type of data. The result of the identification of a continuous other characteristics of the represented data set. This change 

or stepwise continuous parameter which has a presumed or 60 is then reflected by performing a set operation on the data set 

defined relationship with the concept is that the set inclusion in the data space' which corresponds to the manipulation, 

criteria may be modified in a desired manner by altering this Preferably, a metric indicating the effect of the proposed 

parameter. In any given case, one or more parameters may change is outputted, but in some cases, the search result 

be appropriate. In a single parameter system, it is believed itself may be presented. 

that a single axis may be used to define more inclusiveness 65 In a Venn diagram representation, a boundary represents 

and less inclusiveness, with additional axes required for the set inclusion property limitation, and thus data elements 

additional criteria. Thus, for a single set inclusion property, which meet the set inclusion propertv are within the 
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bounded area and data elerueats which do not meet the set contextual analysis of the data sets or data space. The 

inclusion criteria are outside the bounded area. In general, interface may present the user with a dialog box to receive 

the position and the shape of the area and boundary will be specific guidance. Of course, a combination of these strat- 

arbitrary, but in certain instances, the data space is ordered cgies may also be used. 

and the set inclusion criteria define a data set with a "shape" 5 Another aspect of the present invention provides a rank- 
within the data space. Where two data sets are defined within m S algorithm based on an extrinsic database, such as a 
the data space, elements common to both sets will be in an known citation index, such as the Science Citation Index or 
overlapping area bouuded by both boundaries; there mav be Social Science Index, which includes major scholarly- 
data elements bounded bv one or the other boundaries/and references, with indication of the identity of the authors, 
there raav be data elements bounded bv neither boundary. 10 affiUated institution, journal, cited references, title, and 
Where the data sets are drawn from different data spaces, the P^sjbly abstract and key words. Such an index, when 
representation mav be different, although in most instances 7P lo > ed [° rank relevance or importance of the results 
the desired re omental ion bv user will be the sum of hoth ° f a "f* m . ano . ther d ? labaSe ' P[? vid « a sophisticated 

data spaces as the presented composite data space. * ° C -^T^jf^?' a ?* 

c„„. m .i.. M , A i a !s j uuiaa ^ a ~r' various topics will yield hundreds or thousands of "hits", n 

For example set A may be defined as word X within n 15 lhis case? it h desired [Q lhe ^ ^ a { ^ Qrd 

words ot word Yin a fill text database search. The initial so thal relevance or imporlance declines as * he Ust ^ 

search may be performed with n=10. In this case, a change reviewed in order, allowing a truncation of review of the 

in size of the boundary may be interpreted as changing n, so search results without reviewing all of the references and 

that a larger boundary increases n while a smaller boundary allowing a cutoff to be less arbitrarily imposed. Thus, for 

decreases n. The number of set elements which meet the set 20 example, if literature review references are desired, the sort 

inclusion criteria may be indicated graphically or or rank criterion is number of cited references. If scientific 

numerically, to allow feedback as to the effect of the importance is desired, then the number of citations to that 

modification. In like manner, a multidimensional change in reference is the sort or rank criterion. Other criteria which 

size or metric (e.g., shape) may be implemented where the may be employed include the importance of the journal, 

set inclusion criteria are more complex. On the other hand, 25 which may be defined by the user or derived from a 

where two sets, A and B, are defined, their spatial relation- statistical analysis of the citation database itself, the impor- 

ship in the graphic representation may be interpreted as their lance of the institution(s) with which the authors are 

degree ofoverlap. By changing the spatial relationship of the affiliated, the number of cites to articles by an author, the 

representations of set A and B, the set inclusion criteria for relationship between a number of authorities referred to or 

the underlying sets as well as their union and intersection 30 instances of citation by an author and the number of authori- 

may be controlled. In some instances, this is impossible, for ties referring to the author or instances of citation to thai 

example where the represented set inclusion criteria do not author, or any other single criterion or hybrid of multiple 

include any continuous or stepwise continuous formulation. criteria. 

In this case, the user interface rejects an attempt to graphi- In addition to citation indexes, a number of other extrinsic 

cally modify the configuration of the graph, and a form or 35 knowledge sources are available for ranking of data sets. For 

error message indicated to the user. For such a change to be example, as appropriate, electronic dictionaries, encyclope- 

made, the user must alter the underlying concrete set inciu- dias and reference works, may be used for ranking electronic 

sion criteria or define a subsidiary ranking method. On the data. Internet or other electronic catalogs, uews reports, wire 

other hand, the overlap between two sets may have an news feeds, historical (including natural history) materials 

element of gradation, and therefore the positional manipu- 40 also provide useful sources of ranking information. In par- 

lation would be interpreted as changing a threshold, ranking licular instances, business data and data mining systems may 

criterion, set inclusion property, or another aspect, as pos- be used, for example retrieving business inventory, orders, 

sible given the underlying set inclusion properties. operational processes, personnel, performance, "commis- 

Where a number of possible interpretations of a manipu- sions or royalty information. In searches relating to intel- 

lation are possible, the interface may guess the most appro- 45 lectual property, patent, trademark, copyright, other legal or 

priate interpretation or seek elucidation from the user. Thus, technical information databases may be used. Likewise, 

the user may enter set inclusion criteria which result in 100 information on agriculture, veterinary science, animal 

"hits" or data records or portions of records which corre- husbandry, lineage (documents), bloodlines or breeding, 

spond to the criteria. The user, however, may seek an output physical performance, etc. may be used to rank information 

including only 50 such hits. Therefore, the user selects the 50 relating to ihese areas. In fact, such ranking need not be 

bounded region of the GUI which corresponds to the desired semantic in nature, and various types of correlation to 

hits, and graphically seeks to resize the area. Where the image, sound or other data maybe used where appropriate, 

search criterion includes a single continuous or stepwise News reports, biographical information, and news pictures 

continuous parameter, the interface may interpret the resiz- may be used in relation to persons and events. Archives of 

ing as a command to vary this parameter. Where the result- 55 musical, architectural, artistic or theatrical material, or other 

ing data set is ranked, i.e., where there is an algorithm for materials of the creative arts may also include information 

determining the relevance or importance of elements which useful for data ranking. Statistical, demographic, actuarial, 

meet the set inclusion property, the resizing may also be geographic, cartographic, census, or projections of future 

used to vary a threshold, below which the data is not numbers may be used for ranking in some cases. Recipes, 

considered within the boundary. Where there are a number CO formulas, and chemical or industrial process information are 

of possible analogies to the resizing, the interface may seek other types of information sources. Thus, it is apparent that, 

specific guidance from the user, have different resizing according to the present invention, data may be ranked in 

commands, employ a multidimensional resizing command, accordance with a relation with information in a separate 

with the various dimensions representing the various data base, which provides a standard of relevance. The 

degrees of freedom to modify the set inclusion properties, or 65 ranking database and method of ranking % may be automati- 

the interface may guess the correct interpretation, using a set cally selected based on availability and data type or semantic 

of artificial intelligence rules, an adaptive interface, or terms, or manually selected by the user. 
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The invention also contemplates the use of "blockmod- configuration of the graphic image, altering at least one of a 

elling" (or similar technique) to establish hierarchies and size, shape and position of at ieasl one element of the graphic 

represent them visually. See, Harrison C. White et aL, image; and translating the manipulation of configuration as 

American Journal of Sociology, Vol SL(4), pp. 73O-7S0 a modified composite set inclusion criterion. A database 

(1993), expressly incorporated herein by reference. 5 operation may then be executed corresponding to the modi- 

AJthough these blockmodelling techniques are known, their composite set inclusion criterion, 

application to GUIs, data browsers and database search It is another object to provide a graphic interface system 

engines, as an automated tool to establish apparent relevance I° r manipulation by a user of data sets in a database, 

or authority, is not known. Blockmodelling is defined herein comprising a graphic output; a user input having at least one 

as any technique that uses an algorithm to establish ordered 10 substantially continuous degree of freedom corresponding to 

matrices of the relations (such as "like/neutral- dislike*" or "A said S ra P hic 0U W a processor, for receiving the user input 

knows B, B doesn't know A; A doesn't know B, B knows /V and grating the graphic output; and a database, compris- 

A and B know each other, A and B do cot know each other") lI ? g a P lurallt y of daia records each hav, ng at least two 

among social objects (such as publications). Such a lech- <* a ^*™ics, . th 5 P ro " ssor receivin S . at ! eas < * vo *' 

nique can be used to establish, probabilistically, who are the 15 ^^nll^Tr ? ^ K ?™ nlm \ 10 ! he * ra ?. hlc 

members of a scholarly specialty and who has the most X! Z 1 h r f * f 7 ™^ " * 

. „ . , , , , least two regions having a spatial relation m a space, 

apparent authority within the specially, based on the corresponding 10 a composite £1 inclusion criteria; reeeiv- 

relations or citation. ing the user input for man inula ting a con fieu ration of the 

nius, while such systems have certain limitations, they graphic image, altering at least one of a size, shape and 

provide additional tools and opportunities for ranking of 20 position of at least one element of the graphic image, the set 

references. The use of citation indexes is especially useful inclusion criteria, for each manipulation, corresponding to 

because databases of such information are available on-line the degrees of freedom; and translating the manipulation of 

and on CD-ROMs, making implementation possible without configuration as a modified composite set inclusion crite- 

having to amass this data separately for implementation of rion. 

the interface system. 25 It is a further object of the invention to provide a GUI 

Likewise, where relevant, Internet Usenet postings pro- method for representing a search of a database, comprising 

vide similar opportunities, such as by employing "DejaN- the steps of defining a generic graphic icon having a bound- 

ews" (http://www.dej anews.com), "Alta Vista" (hup:// ary and two intersecting areas in the boundary, each area 

wvvw.altavista .digital .com) or other search engines which representing a data set, an intersecting portion of the areas 

allows a search for all posts by an author. In this case, 30 representing a conjunction of the data sets, and having four 

institution information may be less reliable however r domain discrete regions, the portion outside the union of the areas, 

name may be useful information. Further, the "citations" of an intersecting portion of the areas, and portions within one 

the citation indices on the Internet Usenet correspond to the area and not within the other area; receiving from the user 

posting threads, which allow related postings to be grouped a selection of one or more regions within the boundary to 

together. Of course, other types of mass indexed databases 35 define an output data set and presenting the generic graphic 

may be used to rank the set outputs, where the set space has i con °Q the GUI as a first modified graphic icon, having 

a significant relation to the database from which the index is visual indication of the selected regions corresponding to the 

created. defined output set; defining a set inclusion property to 

While for many envisioned set definitions, a Venn correspond to each data set for the first modified graphic 

diagram, defining Boolean logic spaces is sufficient, in other 40 ic °n; receiving from the user a selection of one or more 

cases, other data relationships may be desired or required. regions within the boundary to define an output data set and 

Thus, alternate or additional symbols may be employed. For presenting the generic graphic icon on the GUI as a second 

example, a lime delay or relative lime difference may be modified graphic icon, having visual indication of the 

represented by a clock in a box-icon; exclusive or (XOR) selected regions corresponding to the defined output sei; 

represented by two circles with one or two lines separating 45 defining a set inclusion property to correspond to each data 

them, in a rectangular box; stop indicated by an open hand set ror the second modified graphic icon, one of the set 

pointing up, "go forward" and "go backward" indicated by inclusion properties of the second modified graphic icon 

a hand pointing to the left or to the right, respectively, and the defined output set of the first modified graphic 

a merger or mixing represented by a funnel. icon; outputting the first and second modified graphic icons 

More generally, the BGI is well suited for integration with 50 together on the GUI to represent the composite set inclusion 

other pictographic symbols that are intuitively clear, such properties of the selected regions of the second graphic icon, 

that the GUI is readily understood and remembered by users, Another object of the present invention is to provide a 

regardless of natural language, and without extensive prac- method for manipulating data sets in a data space, compris- 

tice or training. For example, in chemistry, the addition of ing the steps of providing a GUI of a computer system; 

liquid could be represented by a funnel; in electronics, a 55 defining at least two data sets in the data space, each of said 

step-up transformer could be represented by a set of stairs data sets being represented by a primary graphic object 

going upward from left to right; in cooking, a straining presented on said GUI; and manually altering a spatial 

process by a strainer. relationship of said primary graphic objects through said 

ORTFPT^ of top NVPvnnM GUI * 10 mocuf y a secondary graphic object representing a 

UDJtL13 ur int iwvtiNuutN fi0 result of a set operation on said data sets represented by said 

It is therefore an object of the invention 10 provide a primary graphic objects, the graphic structures and the 

graphic interface method for manipulation of data sets in a secondary graphic objects each defining bounded regions 

database, comprising the steps of defining at least two sets selectable through the GUI. 

having distinct set inclusion criteria; representing the at least Further objects of the invention are to provide a Boolean 

two sets as a graphic image of at least two regions in a space 65 graphical interface ("BGI") system as a "front end" tor 

having a spatiaUelation, representing a composite set inclu- searching mass databases, having one or more of the fol- 

sion criterion; receiving a user input for manipulating a lowing features: 
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a. Boolean logical connectors, such as AND, OR, and include not only textual information, but also other infor- 
NOT, represented graphically by "Boole-graphs", mation forms, such as auditorv. visual, olfactorv, or texmrai 
including Venn diagrams., histograms, circle graphs, or touch inputs, and combinations thereof. 

time Unes, clocks geographical maps, graphical repre- and 0{hef o5jects and features of lhe oresem 

semations of AxBx . . . N matrices and tree structures. 5 invention will become more fullv apparent from the follow. 

b. Graphical representation of the entire search, with ing description and appended claims taken in conjunction 

Z'lil t^T ialC Tl th ? u wilh Lhc «om|«nytag drawings, in which like numerals 

backward and forward througn the Boole-graphs with- rct - er t0 ^ ' 

I'M it IACC r\ r I nfrtrm 1 1 lr\rt r\ r no »A t-r* no rff\ »•« -x ~,, r " 



out any loss of information or need to perform a new 
search 



BRIEF DESCRIPTION OF THE DRAWINGS 



c. use of typical GUI gestures to interact with the system, 
such as point and click, drag, etc., to program the search FIG. 1A is a comparative chart of various forms of logical 
strategy, while maintaining intermediate search results. notation, including a set of logical notation symbols accord- 
Within graphic representations, an outlined section of a ing to the present invention; 

Boole-Graph may be selected as a domain for a further FIG. IB is a chart of sample pictographic svmbota or 

search or dragged together with or separated trom other icons according t0 lhe preS eat inwntion; 

outlined portions. r 

d. an indication of an output metric is provided which is FI °- lC lIlustrates lhe interaction of the icons; 
preferably analog, and may be non-visual, such as aural ^IG. ID illustrates some possible ways to signify the 
or textural. The set inclusion functions arc preferablv presence of set elements within a region; 

indicated graphically. ' 20 FIG. 2A is a table according to the prior art; 

J X ?^*°?r?°?r{ ° bje , Ct ° f inVent L° n ' Tnrr FIG - 2B is a l0 § ic representation diagram which extends 

are embodied to a computer program wherein the BGI lhc • r art ' 

system is implemented as an "applet" or module that can be J\ 

filly or partially downloaded from a computer network such flGS. 3 A and 3B are alternate graphic search represen- 

as the Internet, for use by many different computer 25 tations according to the present invention, employing Venn 

platforms, to the extent that* the applet is compiled in a logic diagram notation of the search represented in FIGS. 2 A 

common code that these platforms are enabled to interpret or and 

execute. Such applets may be provided as JAVA or ActiveX FIG. 4 is a conventional logic notation diagram of the 

constructs. The BGI may be attached to a browser and/or search represented in FIG. 2; and 

search engine facility, or can be so attached, so that the parts 30 FIGS. 5A and 5B represent elementary iconic symbols of 

work as an integral unit. the Boolean Graphic Interface according to the present 

According to a further object of the invention, the BGI invention, having two and four inputs, respectively; 

^in^r^ b nf"° f d H ?1ay ^ ^ a !£: he W ? rk ? f an FIG. 6 shows various alternative representations accord- 
in teuigent agent residmg either in the BGI or elsewhere. • lQ thc t iflveQtioQ different f aled b ^ { { 
For example, to periodically update a search and show 35 SV mbolism 
"before" and "after" Boole-graphs. Or to show Boole-graphs " * * . 

representing the empirical results of two different negotiat- HG ' 7 shows aD uscr 1Qterface screeQ according to lhe 

ing stvles followed by the intelligent agent " present invention showing a multi-criteria search and 

According to a still further object of the invention, the Sophie indicators of search results; 

BGI system has a interface that is optionally adaptive, that 40 FIGS. SA(1) and 8A(2), 8B, 8C, 8D and 8E show, 

is, the user can choose to allow the computer, either always respectively, a set of parametric and non-parametric variable 

or in certain instances, to make conclusions about a suitable icons, a Veun diagram icon employed in variable alteration 

appearance or functionality of the interface, based on infer- mode, a parametric icon (e.g., word proximity) as a post 

ences about some or all activity of the user. The adaptivity modifier for a Venn diagram icon, a multidimensional 

may be controlled using graphical set representations 45 median and range operator parametric icon and a dialog box 

according to the present invention. witn slider controls; 

According to an additional object of the invention, the FIG. 9 shows different icons for displaying ranked outputs 

BGI system allows the user to interact with multiple data- according to the present invention; and 

bases and/Or search engines simultaneously, where each FIG. 10 shows a computer displav screen having a split 

separate search is capable of being displayed by a separate 50 screen showing Boolean graphic notation, conventional 

Boole-graph, and where the elements of each Boole-graph Boolean notation of a set inclusion criterion according to the 

can be dragged to separate Boole-graphs, for example, to present invention and a tabular format, 
test cross-intersections. 

According to another object of the invention, the BGI DETAILED DESCRIPTION OF THE 

system displays especially pertinent forms of the BGI, for 55 PREFERRED EMBODIMENTS 
example as chosen from icons, so that improved efficiency 

in user interaction is gained. For example, special BG'l ^ inventioa wili n° w be described by way of the 

optional environments for travel reservations, concert drawings, in which corresponding reference numerals indi- 

bookings, employer or employee wanted, searching for a cale corresponding structures in thc figures, 

qualified professional, other human resources functions, 60 As shown in FIGS. 1A-D, there are various symbolic 

match-making, indexing and tables of contents, self-guided notations for logical functions. A preferred notation accord - 

learning, scheduling, shopping, navigating, web browsing, m S lo l h e present invention is a Venn diagram, column E, of 

the various scientific disciplines, such as chemistry, physics, FIG. 1A which in compact and standardized form provides 

biology, electronics, mechanics, etc., cooking, and other an intuitive representation of the various relationships 

common or specialized activities. 65 between two data sets. 

According to a still further object of the invention, thc As shown in FIGS. 2A and 2B, conventional notation 

system includes the ability to search databases which requires careful study of the words or translation of the 
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symbols into set relationships for comprehension of the relevance, as shown in FIG. 9. These icons may represent a 

symbolic notaticu. Where words are employed, the eSicien- data filter for ordering or ranking the data according to a 

cies of a symbolic representation are lost, and complex desired characteristic and possibly apply a threshold or 

expressions are difficult to comprehend cutoff. The flexible output icons may use, for example, either 

On the other hand, FIGS. 3A-B provide a clear repre- 5 of lhe lwo v *sual representations of the relevance' of "hits" 

sentation of the set relationships of each graphic icon or shown in nG - 9: concentric circles 91, 92. 93, 94, 95, or 

symbol as a set operator, according to the preseut invention. layers of a P> Tarnid 96 ? 97 > 9 $> 99, 100, 101 or triangle 102, 

This is because the symbolic notation includes intuitive 103, 104 ' 105 ' 106 - Id lhc case ot ' concentric circles, the 

graphic clues as to the set relationships indicated bv the ^« e "DOst circles 91, 92 would be denned to have the 

symbol. Further, because all of the binary Boolean logical 10 Sr eale st probable relevance. Similarly, the apex of a triangle 

relationships may be represented by a" single icon, the 102, 103 or Py ramid 96 > 97 would be defined to have the 

present invention has the potential for greater efficiency than greatest probable relevance. In each case, the GUI would 

other schemes which have a more limited number of sym- allow lJie user t0 P oinl to and click on an y rin § or sIic e, or 

bols or require elaborauon of large sets of discrete svmbols. 10 dra§ t0 £ etner (define as active) two or more sections. 

As shown in FIG. 1C, these symbols or icons can be made »5 According to one embodiment, the user would have single 

to interact with each other through the use of graphic logical ste ? ava fl«bility of functions to see all results or to view each 

connectors. Optionally, available to the user is a diagram- successive layer of results, simply by touching one key or 

malic form of Boole Logic, as shown in Column F. As shown swilch or via a sin § le P oimin 3 device command, the result 

in FIG. IB, the logical symbols of the invention arc accom- ! ayers bein 5 ordered b >' a sequence of decreasing or increas- 

panied by a range of" pictographic symbols, depicting 20 m S relevance. The rules guiding selection of a ranking may 

actions, choices, processes, results and warnings, all univer- be ex P liciti >' S* ven by the user or instead, given by the 

sally understandable. ~ computer, or based on experience, artificial intelligence, or 

adaptation of the computer to the user's preferences. 

EXAMPLE L Because the present invention provides for a formalized 

As shown in FIG. 5A, the graphic icon representation 25 notatioa thal can be made to correspond on a one-to-one 

according to a first preferred embodiment of lhe invention ba5lS 10 lhe pnor arl ' lransiaLion t0 acd &om the noiational 

provides an intuitive presentation of the Boolean relation- syslem accordm S 10 lhe P reser *l invention may be provided, 

. ship of two sets, in the form of a stylized Venn diagram. The and 1 re P resenta tion in one form may also be represented in 

stylized icon includes a bounding" rectangular box 51, two lhe olher form as a & nclion oE the system according to the 

intersecting circles 52, 53 of approximately equal' size 30 prcsent mvenllon - An embodiment of such a translation 

spaced along the elongated axis of the rectangular boundarv mechanism, shown in FIG. 10, provides a split screen or 

51, and lines representing the two inputs 54, 55 and output muil iP Ie window display 110 with simultaneous represen- 

56 of the set operation lalion of the Boolean Graphic Interface 111 according to the 

As shown in FIG. 5A, arrowheads may be provided to ™?Tu U equiv f n ' tradtonal Boolean 

indicate inputs and outputs; however; these are not 35 ** n T"n? nV ^ ^P c - A C J M ^ or in one 

necessary, and in fact, a modification of a succeeding search ^ ' Propagated to member 112, 111, 

scope parameter effecting another set inclusion propertv S^^JT 10 USC eUbc , r u f fa "; °r both for input, 

may back propagate to alter that linked set inclusion prop- T^^?^?'^h tn ^^^u 113 

ectv. Thus, the lines indicate linkages, and do not necessarily ° f lradlt tonal Boolean notation according to the present 

indicate a unidirectional information or control flow. ' 40 m ^ c f on - ™» "nmediate translation allows use of the 

~. ic , • , , . interface for database systems which accept onlv traditional 

The icon defines logical dala regions representing a Boolean no[alion 112 { By movi a {n{{ dev{ 

C nmnt Q /d r l °T r P l i ™**"?h °/ C0U !^ m ° re such as a mouse, or touch serein, any part of the interface 

a single step, as shown in FIG. aB, which includes four desired 

intersecting circles 57, 57", 57" and SV" within rectangular 4 Trt ™ nef „,„» , « u i » 

boundary 58, but these are not preferred for general h F ° * ^ USef SdeCtS a " ^ ^"h may 

purposes and are preferably onlv used where a large numbe 1, ! "7 "** ^ " pr ° S ^ mmed WUh lhc des - d 

of sets are combined using a relatively simple operation, ^T.T^^^^T^ ^ ^ ? mmon[ V 

such as AND or OR H ' elected characteristics. In this manner, entire predetermined 

, . * 50 search strategies may be iconized and emploved in subse- 

FIG. 7 shows a computer interface of a computer employ- quen t searches. In the case of a generic icon, 'the inputs are 

ing the graphic icons, tor performing a complex search. In lh en denned. In manv cases, this task is most expeditiously 

tins case the search is a chain ot hve binary criteria 71, 72, performed as a manual set definition in known manner, i.e., 

73, 74, 75, with 6 input sets. A pointer 76 is shown the user types set inclusion criteria into a text dialog box, and 

proximate to one oi the icons 73, which, for example is used 55 pre sses enter, for each of the inputs. The user then selects the 

to select a region within the icon, manipulate a size or desired output function as a relation between the inputs, for 

position ot an icon to select for analysis such as ranking, cxam pi e by defining a Boolean relationship between the 

sorting or biockmodelkng, or to position the icon itself sets. While this step may also be manuallv performed, the 

Below each icon in FIG. 7 is a pair of bar graphs 77, 78, pointer of the interface may be used to select regions of the 

79, 80, 81. A first bar graph 77', 78', 79, 80', 81'represents a oo icon representative of the desired set spaces. The output may 

number of set members which match the indicated inclusion be further used as an input to another operation 

criteria, while a second bar graph 77-, 78", 79% 80" 81" In order to tune the search operation, a number of options 

indicates a ratio of desired set members which meet an are available. ITie interface optionallv provides an indicator 

additional criterion to set members which do not meet the 0 f the size and ratios of the resultant 'sets. As shown in FIG 

additional catena. 65 7f these may be prcsentcd as bar graphs 77< 78 79> 80 81 

In another aspect of the invention, flexible output icons Where the number of elements of the set is different from the 

are employed to represent results ranked as to probable user's expectations, this may be changed in a number of 
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ways. First, the input set definitions may be broadened or NOT A 

uarrowed to iuclude more or fewer elements. This change NOT B 

may be performed manually, by changing the set inclusion A XOR B 

criteria, or through an artificial intelligence or semi- NOT (A XOR B) 

automated means. In an artificial intelligence scheme, the 5 A OR NOT (A OR B) 

computer system analyzes the set inclusion properties and, B OR NOT (A OR B) 

based on rules, neural networks, or fuzzy logic, proposes a NOT (A AND B) 

change in the set inclusion properties which would likely A OR B 

have the desired effect. Where the database is readily acces'- A OR NOT A 

sible with small penalty, an artificial intelligence system may io A selection is thus received from the user of one or more 
be used to probe the database with one or a series of changes regions within the boundary to define an output data set and 
to detennine the effect. In this way, both the specific change presenting the generic graphic icon on the GUI as a first 
type and amount may be analyzed. Thus, the user, in this modified graphic icon, having visual indication of the 
case, instructs the artificial intelligence system of whether selected regions corresponding to the defined output set. In 
the set inclusion criteria are too broad or too restrictive, and 15 addition, as discussed above, a set inclusion property is 
the system thereafter analyzes the search strategy and pro- denned to correspond to each data set for the first modified 
poses a change. In a semiautomated system, the interface graphic icon. Where a generic icon is not employed, to be 
applies far less intelligence to assisting the user, and there- modified by the user, an icon representing a set having 
fore allows the user a finer granularity of control. In this predetermined characteristics may be selected from a pick- 
case, a selectable criterion for altering the set inclusion is 20 list or tool bar. These predetermined icons may provide 
defined, or selected by default. For example, a proximity advanced functionality, not easily available in the generic 
operator between terms or a numeric interval defined. Thus, icon interface, such as chronological functions, custom 
the user manipulates an icon to indicate a relative change in database interfaces, and the like. The predetermined icons 
this operator or interval, and the set inclusion criterion is may be user modifiable. 

correspondingly updated. 25 Where a user selects a Boolean function by selecting 

A second way to narrow a set is to rank the members and regions of an icon, those regions may change color or image 

apply a threshold or cutoff based on the ranking. Often, the density. 

ranking criterion is related to the set inclusion criterion, but After the individual input sets are defined, and an output 

need not be so. Where a set inclusion criterion is defined by relation of the sets defined, the output may be suitable for 

other defined sets, these rankings may be applied to these 30 review by the user as an input set to another search criterion, 

sets before a subsequent set operation, and the ranking The icons are chained together as elements in a flow chart, 

criterion may include the set inclusion criterion from the D .v graphic connector lines which connect the inputs and 

antecedent set definitions. On the other hand, the rankings outputs of the iconic representations. These connector lines 

may also be applied after the composite set operations or ma Y oe smart, i.e., intelligently routed, spaced and sized in 

using extrinsic criteria. 35 accordance with the overall search strategy, graphic presen- 

To broaden a search, for example a text-based search, a lation and P a S e or presentation layout, 

thesaurus may be applied to include words having similar Generally, the entire search sequence will be maintained 

meanings to the search terms. Further, spelling variations durmg lhe scssion * so that & e user ma y review various 

and word root analysis may be used to identify variants in P uts and 0Ul P uls and reverl t0 P ri o r state s if this is 

Athird way to narrow a set is a applv additional criteria, *° » a . s ^ ue nt state. T^us the user may be able 

such as additional composite set functions. ° tCSt of the *** or s f arch refinements without 

. * losing valuable intermediate results. 

This search scope alteration process may be applied to where a variable operator in the set inclusion criteria is 

any set definition, individual or composite. Thus, within an identifiable, this provides the opportunity to alter the scope 

icon, the scope ot any region may be modified. Where the 4S of the sel> 5y simplv varyi lhis operat * or . where nTime ric 

region corresponds to a defined set, preferably any change in dala ^ uscdt [his 0 p cnuor mav be a lhieshold or rangtt - m 

set properties is propagated back to the earlier set definition. which lhe set mem bers reside, 'and thus the set composition 

However, this back pro propagation may be undesired where may be varied by simply changiQg thc cutoff or range. Thus, 

a given set output is applied to multiple inputs, and where a as shoWQ in nG> 8A(1) , an icon may 5e providcd with a 

change for one subsequent operation is not necessarily so simple numeric indicator 84 representative of the parameter, 

desired tor another subsequent operation. allowing simple graphic manipulation by selecting an up 

The preferred icon, as shown in FIG. 5 A, is a generic arrow 85 or a down arrow 86, indicative of an increase or 

Venn operator icon, divided into 4 set spaces 59, 60, 61, 62, decrease in the parameter. On the other hand, in non- 

with two intersecting circles 52, 53 in a rectangular box 51. parametric data, such cutoffs may be somewhat more difli- 

The space within each circle and not intersecting 60, 62 55 cult to apply, as shown in FIG. 8A(2), wherein semantic 

defining two set spaces, the intersection 61 defining a third variable(s) 87, 87' are indicated and the user modifies a 

set space, and the external area 59 defining a fourth set weighting or boundary by selecting up 85 or down 86 

space. These set spaces, are independently selectable, allow- arrows. Even in these cases, however, such parameters 

ing definition of the various Boolean binary operators of the which have a monotonically increasing value over a range 

elements of sets, as shown in FIG. 6: go may still exist. These include proximity operators, number 

null set of occurrences of a search term in a set member, root 

A NOT B analysis of search terms, and semantic variables. 

B NOT A For example, as shown in FIG. 8B, with respect to a full 

NOT (A OR B) text proximity operator, the size of a circle 82 represents the 

A AND B 65 operator for a single set inclusion rule, which is more 

A generally a continuous or stepwise continuous variable. For 

B example, a truncation, semantic analysis, thesaurus, or rel- 
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evance rule may be applied based ou ibis operator. The set system may include a syntax checker and/or an expert 

inclusion operator need not be unidimensional, and in ihe assistant system for helping to define an initial search, 

case of a multidimensional set inclusion rule, the user will, As stated above, a graphic indication mav be presented to 

for example, double click on the graphic object or portion of the user of the results of the search, for example magnitudes 

the object to display a multidimensional graph, such as a 5 and ratios of relevant set members. These may be presented 

circle having a plurality of spaced radially oriented axes, as bar graphs, or other known types of graphic indications, 

e.g., a star graph. The shape of the star graph may be altered After a search strategy is defined, the user may request the 

by resizing any of the radially oriented axes. system to perform an" analysis of the search to determine 

As shown in FIG. 8C, the parametric (or non-parametric) whether search terms are redundant, contradictory, or overly 

icon may be provided as a single input function, applying a 10 or under ly inclusive. This analysis may be done by a logical 

threshold to the represented data set from the Venn operator analysis of the search itself, or by an analysis of the set 

icon 88. members differentiated by the various criteria. The results of 

Likewise, the parametric (or non-parametric) icon may this analysis, if performed, may be presented to the user for 

represent a multiple degree of freedom operator 89, such as action, or used to automatically modify the search. Where an 

mean 90, 90'. and range 113, 113', as shown in FIG. 80. Two 15 automatic modification is provided, the system acts in accor- 

degrees of freedom may represent a practical limit for dance with general guidance provided by the user. For 

allowing feedback of the criterion 113, while greater num- example, where the user inputs an inferential search which 

bers of degrees of freedom are better represented in a full is translated by a semantic parsing engine, the system 

dialog box 115 with, for example, individual slide controls according to the present invention may look to the original 

116 for each degree of freedom. 20 inferential search query to determine an intent of the user, 

Where the icon or graphic representation is altered by a and may override the semantic parsing engine, without 

manipulation, it is often helpful for a window to display a intervention by the user. On the other hand, where the user 

corresponding change to the explicit search expression, and specifically inputs a search query, the system must generally 

for changes to propagate through the entire search strategy seek guidance from the user before' altering the search 

after being finalized. A multiple level "undo" and '"redo" 25 strategv. 

capability is therefore helpful. In Internet applications, this This' guidance may come in the form of responses to 

process is usually available through ''backward" and "for- computer-generated queries, or user manipulation of icons 

ward" icons. in a manner which is interpreted by the system to allow 

Where the database is easily and efficiently accessible, it alteration of one or more search expressions. This manipu- 

is preferred that the search strategy be updated after the user 30 lation includes selection of portions of graphic elements, and 

makes a search strategy modification, so that feedback may repositioning or resizing oh the display with respect to the 

be obtained immediately. On the other hand, where a penalty other graphic objects, to represent a set operation to.be 

is incurred by accessing the database, it is preferred that the performed on the data set. Each set representation includes 

user be given an opportunity to make all desired changes as graphic objects which are "selectable** and modifiable, 

a group before submitting the search. Therefore, the user is 35 meaning the graphic object may be operated on separately 

given this choice or the system obtains this information and from other graphic objects, but'more importantly, any new 

acts accordingly. data sets resulting from the set operation are represented by 

According to the present invention, new interface con- a secondary (or tertiary, etc.) graphic object which is also 

structs need not be implemented, and therefore known GUI "selectable". Further, these primary and higher order sets 

elements, such as buttons, dialog boxes, multiple view 40 can also be graphically merged or re-merged by a drag 

windows, pull-down or pop-up menus, button bars, status technique. 

bars, context sensitive pop-up windows, scrolling and drop- In a complex search, the chain or operations mav be 

down lists, scroll bars, garbage can icons, blow-up and formidably large. In a preferred embodiment of the 

shrink-down, and the like, may be usefully employed. invention, a user, when viewing a particular part of a chain 

Therefore, typical GUI system application development 45 of operations, can zoom in (enlarge) or zoom out (contract) 

tools and libraries may be used to implement the present in relation to an area of interest, or create a fish-eye view 

system. with a f 0CU5 on lnat area? { e ? provide selective detail on a 

The overlap between two circles 82, 83 represents a set portion of the graphic space, 

inclusion operator between the two sets represented by the The present system is not limited to searching data sets 

circles, and a unidimensional rule may be altered by moving 50 using crisp logic, and may therefore include fuzzy logic 

one circle with respect to the other. Again, in the case of a searches having corresponding set inclusion properties and 

multidimensional set inclusion rule, a multidimensional set member rankings. In this case, the manipulation of 

graphic representation may be accessed by a secondary graphic objects need not be limited to binary relationships, 

operation. In general, in each case, one designated default and therefore an amount or proportionality of movement 

operator will be represented by the unidimensional alteration 55 may be used to indicate a set inclusion propertv or set 

of the iconic representation, while some or all of the set function. The manipulation of the graphic element, in this 

inclusion properties may be accessed by way of the second- case, may be used to alter a boundary function of a fuzzy set. 

ary operation. It [ s noted that search strategies may be modified implic- 

Thus, to generate a query, producing a resultant data set, itlv by methods other than graphic manipulation. For 

the user first defines a plurality of set inclusion criteria, using 60 example, a user may define a desired result parameter, which 

any known method, including explicit definition of these the system then uses to tune the search strategy to achieve 

criteria. These criteria may be schematic or imprecise, and this parameter. For example, a user mav seek the 100 "best" 

indeed an advantage of the present invention is its ability to results with given criteria, and thus the system mav then 

graphically assist the user in refining the search after initial analyze the database to determine an optimum "search 

entry; however, the initial search should be valid, so that it 65 strategy, which is then implemented, 

may be used as a point of reference for direction of modi- Artificial intelligence may be applied to analyze the 

fications. For this purpose, the search expression entry composite set inclusion criteria, and to propose an analogous 
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change to the search which corresponds lo the gesture or 
indication of the user. Thus, in a full text proximity search- 
able system, a search criteria A AND \\ might, lor example, 
he narrowed in the order of A (same section) B. A (same 
paragraph) B. A (same sentence) B. A (adjacent) B. 
Alternately, the word spacing between A and B may be 
narrowed by successively reducing the operator, A (within n 
words of) B. Using, artificial intelligence, these various 
schemes may be intermixed or hybridised, and indeed, the 
search strategy may be tuned based on a number of reported 
hits, if Uhi low, the strategy is made less restrictive, if loo 
high, it is made more restrictive, according to the hierarchy 
of the search. 

Where a result is requested based on a number of "best" 
set members, a ranking criterion may he applied, (icncralty, 
such a ranking requires, a more thorough analysis or" the set 
members, although simple ranking schemes are also known. 
For example, date, size, numeric or alphabetic ordering and 
the like may be- applied as appropriate. 

In analyzing system output, a user may review output set 
membership, and manually include, exclude or rank mem- 
bers. The system may then formulate an "intelligent agent" 
which extracts a ranking or inclusion rule to complete ihe 
task or to search other data spaces Lor relevant data. 

When ranking a set member, the primary set inclusion 
criterion is fulfilled, and only then is a secondary criteria 
applied lo order the set members. The ranking is generally 
applied where the next stage of the process truncates the first 
set based on this ranking, for example taking only a certain 
number of set members or those having a ranking score 
above a certain level. In some instances, the ranking is 
directly related to the search criterion, i.e., the ranking is 
along the same scale as the limiting variable. In other 
instances, this ranking is truly a secondary operation ami has 
little to do with the primary criterion, such as dale ordering, 
or other criteria extrinsic to the primary search criterion. 

The interface system may. as a default, present set mem- 
" bers to the next stage or output according to the inclusion 
criteria, with a weighting test where muitivariablc inclusion 
criteria are employed. The user then may modify this 
ranking by. Lor example, opening a text dialog box or a 
graphic window. In certain instances, the ranking criterion 
may be defined by graphically manipulaling an icon or 
graphic image in corresponding manner to the set inclusion 
criterion, while in other instances it is preferably defined 45 
explicitly by the user. Thus, m a multivariate set inclusion 
criteria, the user may be presented with a star graph icon for 
defining the relative weighting of each of the multiple 
variables in the rankings. 

Once the sol inclusion and ranking arc defined, these may 
be manipulated in multiple set operations. Preferably, sets 
arc handled two at a time, with a simple two set Venn 
diagram representing the opera lions. In some instances, 
however, it may be desired 10 have a larger number of sets 
- combined in a single operation, and more complex icons 
may Ix: provided for this purpose. A four input set operator 
icon is provided as shown in FIG. 5B. In this icon, each 
circle has ai leasl ihree degrees of freedom, X-axis, Y-axis 
and radius, representing a distance from a vertically aligned 
circle, a horizontally aligned circle, and a sei inclusivencss 
function, respectively This, however, does nol allow an 
easily accessible relationship to l>c defined with respect to a 
diagonally oriented circle. 

Thus, a multiple set icon is preferably employed only 
where a small amount of control over the output is required, 
for example for relatively simple set relations, such as AND. 
OK. and NOT, to nine set inclusion properties of a set. and 



rank the resulting output set. Alter defining the Venn set 
spaces selected for the output by the Boolean logic 
operations, the area representing portions of the set space 
may be relatively enlarged or contracted by graphic mauipu- 
< lation of ihe boundaries of the region, for example. In like 
manner to the single set operations, a continuous or stepwise 
continuous variable is altered based on the graphic manipu- 
lation. Also in like manner to single set operations, a default 
variable is established with potential for override by the user 
and manipulation of multiple variables. Where the search 
iiself does not allow or suggest such a variable, the user may 
request the user lade fine how he or she would like the search 
lo be modified. The output, set may be ranked according to 
default criteria corresponding to the input sets and/or the" set 
operation between the multiple sets. 
15 While, as discussed above, the tuning of the single set 
inclusion is separated from the multiple set operations, it is 
also possible to lune both the input and output scl inclusion 
pa)perties by graphic manipulalion of a multiple set icon. 
Thus, since each input set is signified by a bounded region 
in a Venn diagram, the border of the bounded region may be 
enlarged or contracted lo alicr the scl inclusion property. 
Further, any region or regions may be selected for attempted 
manipulation, including the region outside the defined set 
spaces, the intersection and non-overlapping portion of each 
25 set. Likewise, the ranking of the each region in the Venn 
diagram, or composite regions, may he ranked. 

Where an icon representing multiple sets is employed, an 
additional gesture is available for indicating to the interface 
that a manipulation is desired, Therefore, noi only may ihe 
size of a region by manipulated, bui also the relative spacing 
and arrangement of the spaces. For example, where the 
intersection space is relevant lo the definition of the output 
set. a relative movement of one set boundary away from the 
other may be interpreted as narrowing the inclusion criteria 
for the intersection. Likewise, a movement toward one 
35 another may be interpreted as an instruction to broaden the 
set inclusion criterion. 

In the case of Venn diagrams in this paradigm, a range of 
graphical modifications are possible, representing different 
types of possible set operations. For example, set A may be 
ao defined as word X within n words of word Y in a full lext 
database search. The initial search may be performed with. 
n-5. In this case, a change in size of the boundary may be 
interpreted as changing n, so thai a larger boundary increases 
n while a smaller boundary decreases n. The number of sei 
elements which meet the set inclusion criteria may be 
indicated, to allow feedback as lo the effect of the modifi- 
cation. In like manner, a multidimensional change in size or 
metric (e.g., shape) may be implemented where the sei 
inclusion criteria are more complex. On the other hand, 
where two sets. A and B are defined, their spatial relationship 
in the graphic representation may be interpreted as their 
degree of overlap. By changing Ihe spatial relationship of the 
representations of set A and B, the set inclusion criteria for 
the underlying sets as well as their union and intersection 
55 may be controlled. In some instances, this is impossible, for 
example where the represented set inclusion criteria do not 
include any continuous or stepwise continuous formulation. 
In this case, the user interface rejects an attempt 10 graphi- 
cally modify the configuration of the graph. For such a 
60 change lo be made, the user musl a ller the underlying 
concrete scl inclusion criteria or define a subsidiary ranking 
method. On the other hand, ihe overlap between two sets 
may have an element of gradation, and therefore the posi- 
tional manipulalion would be interpreted as changing a 
threshold, ranking criteria, set inclusion property, or another 
aspect, as possible given die underlying set inclusion prop- 
erties. 
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Where a aumber of possible interpretations of a manipu- domain name may be useful information. Funher, the i; cua- 

lation are possible, the interface may guess the most appro- lions" of the citation indices on the Internet Usenet corrc- 

priate interpretation or seek elucidation from the user. Thus, spond to the posting threads, which allow related postings to 

the user may enter set inclusion criteria which result in 100 be grouped together. Of course, other types of mass indexed 

''hits" or data records or portions of records which corre- 5 databases may be used to rank the set outputs, where the set 

spond to the criteria. The user, however, may seek an output space has a significant relation to the database from which 

including only 50 such hits. Therefore, the user selects the the index is created. 

bounded region of the GUI which corresponds to the desired While for many envisioned set definitions, a Venn 

bits, and graphically seeks to resize the area. Where the diagram, defining Boolean logic spaces is sufficient, in other 

search criteria, includes a single continuous or stepwise io cases, other data relationships may be desired or required, 

continuous parameter, the interface may interpret the resiz- Thus, alternate or additional symbols may be employed. Tor 

iug as a command to vary this parameter. Where the result- example, a time delay or relative time 'difference may be 

ing data set is ranked, i.e., where there is an algorithm for represented by a clock in a box-icon; exclusive or (XOR) 

determining a relevance or importance of elements which represented by two circles with one or two lines separating 

meet the set inclusion property, the resizing may also be is them, in a rectangular box; and a merger or mixing repre- 

used to vary a threshold, below which the data is not sen ted by a funnel, 
considered within the boundary. Where there are a number 

of possible analogies to the resizing, the interface may seek EXAMPLE 2 

specific guidance from the user, have different resizing . . - . - . . 

commands, employ a multidimensional resizing command, 20 l f* ad ° f < ntenn S *>< definitions, as provided m the 

with the various' dimensions representing the various meth ° d aCC ° t rdmg 10 E * ample l ' \ user detlne , S a MMdl 

degrees offreedom to modify the set inclusion properties, or usm § P nor knowa techniques, such as natural language 

the interface mav guess the correct interpretation, usins a set *f rchin S> ^? b V exam P le . ™ other known techniques, 

of artificial intelligence rules, an adaptive interface, or ^sing these teenntques, a searcn expression is constructed, 

contextual analysis of the data sets or data space. Of course, 25 USl ^ Boolean an ] a u olh u er aala relationships, which is 

a combination of these strategies mav also be used. n u ° rmaliy nm ^ployed b J u *er. However, according to 

Another aspect of the present invention provides a rank- lhc P" sent ™™Uon, this derived search strategy is pre- 

ing algorithm based on an extrinsic database, such as a «nted as a senes of icons mterconnected in accordance ^th 

known citation index, such as the Science Citation Index or the prcse f ™™*™> ««* » 15 ^own in FIG. 7. Thus, the 

Social Science Index, which includes major scholarly 30 US * r need not "P 1 ™ 1 ? d ^ ne thc « arch Parameters but 

references, with indication of the identity of the authors, rathcr ' 10 aut ° maled ° r mtc " l S cnt S ^ era ma >' bc tm V ]o ?< 6 

affiliated institution, journal, cited references, title, and t0 generate the mitial searcfa " • 

possibly abstract and key words. Such an index, when After tne ^ck is formulated using the known prior art 

employed to rank the relevance or importance of the results techniques, the user is presented with the results. While 

of a search in another database, provides a sophisticated 35 results may sometimes bc acceptable, often the user 

means for evaluating references. Tvpicailv, a search of desires a search of differing scope than that presented, 

various topics will yield hundreds or thousands of "hits". In Therefore, the interconnected icons may bc manipulated. to 

this case, it is desired to present the ''hits" in a logical order, provide a degree of explicit control over the search expres- 

so that relevance or importance declines as the list is sion ' For example, search limitations may be eliminated or 

reviewed in order, allowing a truncation of review of the 40 b yP assea D V eliminating an icon or selecting the output to be 

search results without reviewing all of the references and ec * ual t0 one of the in P uls - Further, the logical relationships 

allowing a cutoff to be less arbitrarily imposed. Thus, for ma y bc modified by selecting or deselecting set 

example, if literature review references are desired, the sort relationships, i.e., Boolean set spaces. According to the 

or rank criterion is number of cited references. If scientific P«scni invention, parametric search expressions may be 

importance is desired, then the number of citations to that 45 allered b y manipulation of individual icons, as discussed 

reference is the son or rani; criterion. Other criteria which above. 

may be employed include the importance of the journal, Where the screen display is generated based on a search 

which may be defined by the user or derived from a strategy indirectly inputted by the user, and sometimes even 

statistical analysis of the citation database itself, the impor- where expressly inputted by the user, it may be desirable to 
lance of the institutions) with which the authors are 50 include on the main display the individual search expres- 

affiliated, the number of cites to articles by an author, or any sions. Thus, proximate to the icon representing the search 

other single criterion or hybrid of multiple criteria. Each of expression is a text or data window presenting the search 

these rankiug criteria may be derived from the citation expression itself, similar to that shown in FIG. 10, or a 

database, thus allowing standardization and ease of updal- screen button for easily accessing the search expression, 
ing. Thus, while such a system has certain limitations, it 55 This will allow a user to view both symbolic and concrete 

provides an additional tool for ranking of references. The expressions in the same view. In this case, the graphical 

use of citation indexes is especially useful because databases representations and the text or data representations are 

of such information are available on-line and on CD-ROMs, preferably linked, so that where one is updated or changed, 

making implementation possible without having to amass the corresponding other is also updated or changed, 
this data separately for implementation of the interface 60 Likewise, where multiple search expressions are chained or 

system. linked, a change in any one preferably propagates through to 

Likewise, where relevant, Internet Usenet postings pro- the output, 

vide similar opportunities, such as by employing "DejaN- The changes that a user makes to a search may be 

ews" (http://www.dejanews.com), ! * AltaVista" (hup:// indicative of the users desires and characteristics. 
www.altavista.digital.com) or other search engines which 65 Therefore, an option is provided for, the system to leam from 

allows a search for all posts by an author. In this case, the user in order to tailor future searches in accordance with 

institution information may be less reliable, however, the user's desires and characteristics. Gene rallv, the user will 



